Â ± σ A ± t A ˆB ± σ B ± t B. (1)

Size: px

Start display at page:

Download "Â ± σ A ± t A ˆB ± σ B ± t B. (1)"

Morris Osborne
5 years ago
Views:

1 Version Introduction Combining measurements which have theoretical uncertainties is a delicate matter. Indeed, we are often in the position in which sufficient information is not provided to form a completely principled combination. However, we have developed a procedure which tends to avoid the worst pitfalls. We describe this procedure here. Suppose we are given two measurements, with results expressed in the form: Â ± σ A ± t A ˆB ± σ B ± t B. () Assume that Â has been sampled from a probability distribution of the form p A (Â; Ā, σ A), where Ā is the mean of the distribution and σ A is the standard deviation. We make the corresponding assumption for ˆB. The t A and t B uncertainties in Eq. are the theoretical uncertainties. We may not need to know exactly what that means here, except that the same meaning should hold for both t A and t B. We suppose that both Â and ˆB are measurements of the same quantity of physical interest, though possibly with quite different approaches. The question is: How do we combine our two measurements? Let the physical quantity we are trying to learn about be denoted θ. Given the two results A and B, we wish to form an estimator, ˆV for V, with statistical and theoretical uncertainties expressed separately in the form: ˆV ± σ V ± t V. () The quantities ˆV, σ V,andt V are to be computed in terms of Â, ˆB,σ A,σ B,t A, and t B. Forming the Weighted Average In the absence of theoretical uncertainties, we would normally combine our measurements according to the weighted average: ˆθ = Â σ A + ˆB σ B + σ σ A B ± σ A + σ B. (3) For simplicity, we are assuming here that there is no correlation between the measurements. In general, Â and ˆB will be biased estimators for V : Ā = θ + b A B = θ + b B, (4)

2 where b A and b B are the biases. We adopt the point of view that the theoretical uncertainties t A and t B are estimates related to the possible magnitudes of these biases. That is, t A b A t B b B. (5) We wish to have t V represent a similar notion. Without yet specifying the weights, assume that we continue to form ˆV as a weighted average of Â and ˆB: ˆθ = w AÂ + w B ˆB, (6) w A + w B where w A and w B are the non-negative weights. The statistical error on the weighted average is computed according to simple error propagation on the individual statistical errors: The bias for ˆθ is: σ = w A σ A + w B σ B (w A + w B ). (7) b = ˆθ θ = w Ab A + w B b B. (8) w A + w B If the theoretical uncertainties are regarded as estimates of the biases, then the theoretical uncertainty should be evaluated with the same weighting: t = w At A + w B t B, (9) w A + w B It may be noted that this posesses desirable behavior in the limit where the theoretical uncertainties are identical (completely correlated) between the two measurements: The theoretical uncertainty on ˆV is in this case the same as t A = t B ; no reduction is attained by having multiple measurements. However, it is not quite true that the theoretical uncertainties are being regarded as estimates of bias. The t A and t B provide only estimates for the magnitudes, not the signs, of the biases. Eq. 9 holds when the biases are of the same sign. If the biases are opposite sign, then we obtain t = w At A w B t B. w A + w B (0) Thus, our formula 9 breaks down in some cases. For example, suppose the theoretical uncertainties are completely anticorrelated. In the case of equal weights, the combined theoretical uncertainty should be zero, because the two uncertainties are exactly canceled in the combined result. Only a statistical uncertainty remains.

3 Unfortunately, we don t always know whether the biases are expected to have the same sign or opposite sign. As a default, we adopt the procedure of Eq. 9. In the case of similar measurements, we suspect that the sign of the bias will often have the same sign, in which case we make the right choice. In the case of quite different measurements, such as inclusive and exclusive measurements of V ub, there is no particular reason to favor either relative sign; we simply don t know. The adopted procedure has the property that it errs on the side of conservatism we will sometimes overestimate the theoretical uncertainty on the combined result. There is still a further issue. The results of the measurements themselves canprovideinformationonwhatthetheoretical uncertainty could be. Consider two measurements with negligible statistical uncertainty. Then the difference between the two measurements is the difference between the biases. If the measurements are far apart, on the scale of the theoretical uncertainties, then this is evidence that the theoretical uncertainties are of opposite sign. We make no attempt to incorporate this information, again erring on the conservative side. We turn to the question of choice of weights w A and w B. In the limit of negligible theoretical uncertainties we want to have w A = σa () w B = σb. () Using these as the weights in the presence of theoretical uncertanties can lead to undesirable behavior. For example, suppose t A t B and σ A σ B. The central value computed with only the statistical weights ignores the theoretical uncertainty. A measurement with small theoretical uncertainty may be given little weight compared to a measurement with very large theoretical uncertainty. While not wrong, this does not make optimal use of the available information. We may invent a weighting scheme which incorporates both the statistical and theoretical uncertainties, for example combining them in quadrature: w A = w B = σa + t A σb +. (3) t B Any such scheme can lead to unattractive dependence on the way measurements may be associatively combined. We nevertheless adopt this procedure, with the understanding that it is best to go back to the original measureaments when combining results, rather than making successive combinations. 3

4 3 Inconsistent Inputs It may happen that our measurements are far enough apart that they appear inconsistent in terms of the quoted uncertainties. Our primary goal in this analysis is to test consistency between available data and the standard model, including whatever theoretical uncertainties exist in the comparison. We prefer to avoid making erroneous claims of inconsistency, even at the cost of some statistical power. Thus, we presume that when two measurements of what is assumed to be the same quantity appear inconsistent, something is wrong with the measurement or with the thoeretical uncertainties in the computation. If we have no good way to determine in detail where the fault lies, we adopt a method similar to that used by the Particle Data Group (PDG) to enlarge the stated uncertainties. Given our two measurements as discussed above, we define the quantity: χ w A (Â ˆθ) + w B ( ˆB ˆθ). (4) In the limit of purely statistical and normal errors, this quantity is distributed according to a chi-square with one degree of freedom. In the more general situation here, we don t know the detailed properties, but we nonetheless use it as a measure of the consistency of the results, in the belief that the procedure we adopt will still tend to err toward conservatism. If χ, the measurements are deemed consistent. On the other hand, if χ >, we call the measurements inconsistent, and apply a scale factor to the errors in order to obtain χ =. We take the point of view that we don t know which measurement (or both) is flawed, or whether the problem is with the statistical or theoretical error evaluation. If we did have such relevant information, we could use that in a more informed procedure. Thus, we scale all of the errors (σ A, σ B, t A, t B )byafactor: S = χ. (5) This scaling does not change the central value of the averaged result, but does scale the statistical and theoretical uncertainties by the same factor. 4 Relative Errors We often are faced with the situation in which the uncertainties are relative, rather than absolute. In this case, the model in which θ is a location parameter of a Gaussian distribution breaks down. However, it may be a reasonable approximation to continue to think in terms this model, with some modification to mitigate bias. We also continue to work in the context of a least-squares minimization, although it might be interesting to investigate a maximum likelihood approach. I believe that the approach suggested here is consistent with the proposal Bob Kowalewski is making for HFAG averages. 4

5 Thus, suppose we have additional experimental uncertainties s A and s B, which scale with θ: s A = r A θ, s B = r B θ. (6) If s k is what we are given, we infer the proportionality constants according to r A = s A /Â and r B = s B / ˆB. The weights that are given in Eqn. 3 are modified to incorporate this new source of uncertainty according to: w A = σa +(r ˆθ) A + t A w B = σb +(r ˆθ). (7) B + t B Note that, as we don t know θ, weuseˆθ instead. This means that the averaging process is now iterative, until convergence to a particular value of ˆθ is obtained. Likewise, there may be a theoretical uncertainty which scales with θ, and we may treat this similarly. Thus, suppose that, for example, t A = t aa t ra, where t aa is an absolute uncertainty, and t ra = ρ A θ. We simply replace θ by ˆθ and substitute this expression wherever t A appears, e.g., in Eqn. 7. That is: w A = σa +(r ˆθ) A + t aa +. (8) ρ A ˆθ 5 Summary of Algorithm We summarize the proposed algorithm: {x i i =,,...,n} with error matrix Suppose we have n measurements M ij (x i x i )(x j x j ), (9) and mean values x i = θ + b i. (0) Note that, in the non-correlated case, M ij = σi δ ij, or including relative uncertainties, M ij = δ ij (σi + r i x i ). The parameter we are trying to learn about is θ, andtheb i is the bias that is being estimated with theoretical uncertainties t i. The present notion of the weighted average is that we find a θ which minimizes: χ = (x i θ) W ij (x j θ). () i,j This is based on the premise that we don t actually know what the biases are, and we do the minimization with zero bias in the (x θ) dependence. The 5

6 possible size of bias is taken into account in the weighting, giving more weight to those measurements in which the size of the bias is likely to be smaller. The weight matrix W in principle could be taken to be: That is, W is an estimate for (W ) ij = M ij + t i t j. () (x i θ)(x j θ) = M ij + b i b j. (3) However, we don t assume that we know the relative signs of b i and b j. Hence, the off-diagonal t i t j term in Eqn. could just as likely enter with a minus sign. We therefore use the weight matrix: (W ) ij = M ij + t i δ ij. (4) If we do know the relative signs of the biases, for example because the theoretical uncertainties are correlated, then the off-diagonal terms in Eqn. should be included, with the appropriate sign. Setting dχ /dθ θ=ˆθ = 0 gives the central value ( best estimate): i,j ˆθ = W ijx j i,j W. (5) ij The statistical uncertainty is σ = i,j (WMW) ij i,j W ij (6) Note that this reduces to σ = i,j (M ) ij, (7) in the case of only statistical uncertainites. The theoretical uncertainty is i,j t = W ijt j i,j W, (8) ij where t j t aj +(ρ j ˆθ). (9) Finally, if χ >n, these error estimates are scaled by a factor: χ S = n, (30) where χ here is the value after the minimization. 6

7 A Comparison with treating theoretical uncertainties on same footing as statistical Another approach to the present problem is to simply treat the theoretical uncertainties as if they were statistical. This procedure gives the same estimator as above for θ. However, the results for statistical and theoretical uncertainties differ in general. Let σ be the estimated statistical uncertainty on the average for this approach, and let t be the estimated theoretical uncertainty. Also, let T ij be the covariance matrix for the theoretical uncertainties in this picture. Then the statistical and theoretical uncertainties on the average are given by: σ = t = n n Note that the weights are given, as before, by i,j,k M ijw jk i,j W ij, (3) i,j,k T ijw jk i,j W. (3) ij W ij =(M + T ) ij. (33) That is, the weights are the same as the treatment earlier, if the same assumptions about theoretical correlations are made in both places. The estimates for the statistical and theoretical uncertainties differ between the two methods. That is, in general, σ σ and t t. The statistical uncertainty σ is computed from the individual statistical uncertainties according to simple error propagation. The statistical uncertainty σ is evaluated by identifying a piece of the overall quadratic combination of statistical and theoretical uncertainties as statistical. The difference between t and t is that t is computed as a weighted average of the individual t s, while t is evaluated by identifying a piece of the overall quadratic combination of statistical and theoretical uncertainties as theoretical. The approach for t is based on the notion that the theoretical uncertainties are estimates of bias, but with a conservative treatment of any unknown correlations. The t approach may be appropriate if the theoretical uncertainties are given a probablistic interpretation. Let s consider some possible special cases. Suppose that all of the t i s are the same, equal to t, and suppose that the theory uncertainties are presumed to be uncorrelated. In this case, t = t (34) t = t / n. (35) Which is more reasonable? That depends on how we view the meaning of uncorrelated in our assumption, and on whether we assign a probabilistic 7

8 interpretation to the theoretical uncertainties. If we are supposing that the acutal theoretical uncertainties are somehow randomly distributed in sign and magnitude, then it is reasonable to expect that the result will become more reliable as more numbers are averaged. However, if we consider the theoretical uncertainties as estimates of bias, which could in fact all have the same sign, then the weighted linear average is plausible. It is at least a more conservative approach in the absence of real information on the correlations. Note that if the correlation in theoretical uncertainty is actually known, the weighted linear average will take that into account. For example, suppose there are just two measurements, with t = t. If the weights are the same (that is, we also have σ = σ )thent = 0. The other approach also gives( t =0. ) σ A different illustrative case is when t =0,t 0,andM = 0 0 σ.in this case, we find ( ˆθ = ˆθ x = σ + + x )/( t σ σ + + ) t σ, (36) σ σ = (σ + + /( t ) σ σ + + ) t σ, (37) t = t σ +t + σ +t σ ( σ = σ σ + t t = t σ + t, (38) )/( + /( σ + + t σ σ + + t σ ), (39) ). (40) To understand the difference better, consider the limit in which t σ,σ : ˆθ = ˆθ σ = x σ + + x x, (4) t σ = σ, (4) σ t = t σ + 0, (43) t σ = σ, (44) t = σ. (45) In this limit, both methods agree that the important information is in x.the first method assigns a statistical error corresponding to the statistical uncertainty of x, and a theoretical uncertainty of zero, reflecting the zero theoretical uncertainty on the x measurement. The second method, however, assigns equal 8

9 statistical and theoretical uncertainites to the average. Their sum in quadrature is a plausible expression of the total uncertainty, but the breakdown into theoretical and statistical components is not reasonable. Another limit we can take in this example is σ t σ, obtaining: ˆθ = ( ˆθ t ) = x + x σ x, (46) σ = σ (47) t = t, (48) σ = t, (49) t = t. (50) Similar observations may be made in this case as in the previous. 9

arxiv: v1 [physics.data-an] 3 Jun 2008

arxiv: v1 [physics.data-an] 3 Jun 2008 arxiv:0806.0530v [physics.data-an] 3 Jun 008 Averaging Results with Theory Uncertainties F. C. Porter Lauritsen Laboratory for High Energy Physics California Institute of Technology Pasadena, California